Experimental Investigation into Alignment-based Acoustic Confidence Measures in Keyword Verification for Mandarin Speech
نویسندگان
چکیده
This paper introduces the methods to verify keyword hypothesis using alignment information, which can be easily implemented and integrated into the keyword spotting system in a straightforward way. Acquiring alignment information goes without any extra model training, and has no evident total processing time impact on original keyword spotting system. Different alignment-based confidence measures are evaluate using ROC curves. In addition, we introduce a novel evaluation metric, Confidence Error Cost Function (CECF), which improves upon Confidence Error Rate (CER). It is more general than CER since it takes different cost weights of FA and FR into consideration. We present the result of experiments aiming at assessing the quality and the limitations of different confidence measures and find out that the combination of minimum alignment cost and state duration normalized posterior probability gives the best performance.
منابع مشابه
Articulatory-feature-based confidence measures
Confidence measures are computed to estimate the certainty that target acoustic units are spoken in specific speech segments. They are applied in tasks such as keyword verification or utterance verification. Because many of the confidence measures use the same set of models and features as in recognition, the resulting scores may not provide an independent measure of reliability. In this paper,...
متن کاملIntegration of phonetic and prosodic information for robust utterance verification - Vision, Image and Signal Processing, IEE Proceedings-
Mandarin speech is known for its tonal charactcristic, and prosodic information plays an important role in Mandarin speech recognition. Driven by this propcrty, phonetic and prosodic information are integrated and used for Mandarin telephone speech keyword spotting. A two-stage strategy, with recognition followed by verification, is adopted. For keyword recognition, 132 subsyllable models, two ...
متن کاملUtterance Verification Using Prosodic Information for Mandarin Telephone Speech Keyword Spotting - Acoustics, Speech, and Signal Processing, 1999. Proceedings., 1999 IEEE International Conference o
In this paper, the prosodic information, a very special and important feature in Mandarin speech, is used for Mandarin telephone speech utterance verification. A two-stage strategy, with recognition followed by verification, is adopted. For keyword recognition, 59 context-independent subsyllables, i.e., 22 m s and 37 FINAL’S in Mandarin speech, and one backgroundkilence model, are used as the b...
متن کاملDistributed Chinese Keyw Verification for Spoken Wireless Envir
With the rapid developments of wireless communications, it is highly desired for users to access the network information with spoken dialogue interface via hand-held devices at any time, from anywhere. One possible approach towards this goal is to perform speech feature extraction at the hand-held devices (the clients) and have all other recognition tasks and dialogue functions absorbed by the ...
متن کاملUtterance verification using prosodic information for Mandarin telephone speech keyword spotting
In this paper, the prosodic information, a very special and important feature in Mandarin speech, is used for Mandarin telephone speech utterance verification. A two-stage strategy, with recognition followed by verification, is adopted. For keyword recognition, 59 context-independent subsyllables, i.e., 22 INITIAL’s and 37 FINAL’s in Mandarin speech, and one background/silence model, are used a...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2006